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ABSTRACT 


A generalized  maximum  entropy  principle  is  described  for  dealing  with  de- 
cision problems  involving  uncertainty  but  with  some  prior  knowledge  about  the 
probability  space  corresponding  to  nature.  This  knowledge  about  the  proba- 
listic  structure  is  expressed  through  known  bounds  on  event  probabilities  and 
moments,  which  is  incorporated  into  a nonlinear  programming  problem.  The 
solution  provides  a maximum  entropy  distribution  which  is  then  used  in  treating 
the  decision  problem  as  one  involving  risk. 

An  example  application  is  described  that  involves  the  selection  of  oil 
spill  recovery  systems  for  inland  harbor  regions.  Other  areas  of  application 
are  identified  and  tables  of  some  majiJ_um  entropy  distributions  resulting  from 
a variety  of  moment  constraints  are  provided. 
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An  example  application  is  described  that  involves  the  selection  of  oil  spill 
recovery  systems  for  inland  harbor  regions.  Other  areas  of  application  are 
identified  and  tables  of  some  maximum  entropy  distributions  resulting  from 
a variety  of  moment  constraints  are  provided. 


This  paper  describes  a general  maximum  entropy  principle  (GMEP)  for  deal- 
ing with  uncertainty  within  a decision  analytic  framework.  These  decision 
analysis  problems  are  quite  common,  and  in  fact  are  fundamental,  in  operations 
research  practice  where  one  is  trying  to  make  the  most  out  of  some  limited 
knowledge  and  resources.  Entropy  is  a measure  of  dispersion  that  is  widely 
used  for  quantifying  uncertainty.  Stemming  from  early  work  by  Shannon  [18]  in 
communications  theory,  it  has  been  applied  rigorously  in  psychology  and  en- 
gineering for  quantifying  complexity  and  various  indices  of  workload  related 
to  human  performance  (e.g.  see  Thomas  [22]).  Recently  it  has  been  used  as  a 
performance  measure  in  areas  of  business  and  economics  [1,  4,  9,  10,  21,  27], 
behavioral  sciences  [8,  20],  and  physical  sciences  [7,  23].  Mathematical  en- 
tropy as  a criterion  for  resolving  choices  among  alternatives  has  received 
acceptance  in  location  and  distribution  analysis  [12,  26],  the  design  of  ques- 
tionnaires [5],  statistical  inference  [6,  11],  and  to  an  extent  in  search 
problems  [13]. 

Before  we  discuss  the  maximum  entropy  principle  we  shall  present  a basic 
formulation  of  a decision  analysis  problem.  The  general  principle  involves 
deriving  a maximum  entropy  distribution  for  given  knowledge  about  the  set  of 
states  of  nature.  This  derivation  as  stated  or  alluded  to  by  others  seems  to 
require  that  the  given  knowledge  be  limited  to  that  which  can  be  expressed 
through  the  moments  of  an  associated  random  variable.  Here  we  relax  this  re- 
striction somewhat  by  allowing  for  information  through  bounded  event  probabili- 
ties and  moments.  A numerical  example  is  given  to  demonstrate  application  of 
the  maximum  entropy  principle  in  making  selections  among  alternative  oil  pol- 
lution abatement  systems.  There  are  a number  of  problems  in  systems  design 
and  evaluation  for  which  this  principle  offers  a pragmatic  approach.  Some  of 
the  more  common  areas  of  operations  research  are  summarized  followed  by  some 
concluding  remarks. 
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a set  of  possible  future  states  of  nature  ft  to  which  a set  of  values  1/  is 
associated.  Both  A and  ft  are  non-empty  with  1/  on  A x ft.  For  our  pur- 
poses here,  we  shall  restrict  A to  be  finite.  Let  X be  a random  variable 
that  associates  points  of  ft  to  values,  x,  on  the  real  line  and  has  an  under- 
lying distribution  P.  Thus,  our  decision  problem  is  to 


am|X^  / ^(a,x)  dP(x)  , 


(1) 


where  \!  is  Lebesque  integrable. 


The  problem  is  one  under  risk  whenever  P is  known.  More  commonly  in 
practice,  however,  one  must  deal  with  decisions  under  uncertainty  where  P 
cannot  be  specified.  The  normal  procedure  for  dealing  with  decision  problems 
under  uncertainty  is  to  select  a principle  of  choice,  e.g.  Maximin,  and  ef- 
fectively make  assumptions  about  the  probabilities  of  events  over  ft. 

2.  A MAXIMUM  ENTROPY  PRINCIPLE. 

One  way  of  solving  the  decision  problem  under  uncertainty  is  to  assume  a 
distribution  for  X and  then  solve  (1).  This  procedure  is  implicit  with  the 
Laplace  Principle  of  choice  where  one  assumes  the  states  of  nature  are  equally- 
iikely  to  occur.  In  this  case  the  decision  maker  presumably  views  nature  such 
that  pure  random  chance  is  the  only  vital  operant  and  the  only  perceived  know- 
ledge of  P is  the  range  of  the  random  variable  X.  The  fact  that  different 
decision  makers  percieve  information  pertaining  to  P differently  accounts 
for  the  variety  of  principles  that  are  employed  in  such  analyses.  Any  process 
of  making  assumptions  about  P should  account  for  all  of  the  known  facts  and 
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the  perception  of  chance  to  the  decision  maker. 

Assuming  Distributions  with  Known  Moments. 

A systematic  procedure  for  assuming  distributions  that  has  evolved  over 
the  past  several  years  [18]  is  to  derive  a distribution  that  maximizes  the 
entropy,  subject  to  a set  of  moment  constraints  that  reflect  known  facts  about 
P.  This  involves  solving  the  optimization  problem 

max:  H(P)  = - / log(dP(x) /dx)  aP(x)  (2) 

s.t.  / dP(x)  = 1 , dP(x)  0 

/ £k(x)dP(x)  =pk  , k = 1,  2,... 

where  H(P)  is  the  entropy  associated  with  the  random  variable  X and 

is  an  integrable  function  of  X with  constant  for  k = 1,  2 An 

intuitively  appealing  interpretation  of  H(P)  is  that  it  is  a quantitative 
index  of  the  amount  of  information  that  would  be  required  in  order  to  specify 
outcomes  from  with  certainty.  Solutions  to  (2)  can  be  obtained  using 
conventional  techniques,  such  as  the  calculus  of  variations. 

We  note  that  there  are  at  least  two  constraints,  since  P is  a probability 
distribution  function  and  hence  dP  must  be  nonnegative  and  sum  (or  integrate) 
to  1.  It  can  easily  be  shown  that  with  these  constraints  plus  knowledge  that 
the  range  of  X is  a finite  interval  of  R1,  the  maximum  entropy  distribution 
is  the  uniform.  Now  if  X is  continuous  on  (0,  °°)  with  £^(x)  = x and 
Cj  (x)  =■  0,  for  every  j 1,  then  the  corresponding  solution  to  (2)  is  the  ex- 
ponential. These  and  other  maximum  entropy  distributions  are  given  in  the 
Appendix  for  some  cases  where  prior  knowledge  about  P can  be  specified  through 
moments  of  X.  We  note  that  this  procedure  is  actually  conservative,  since  it 
generates  distributions  that  have  maximum  dispersion.  Smith  [19]  discusses 
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the  equivalence  of  the  Maximum  Entropy  and  Minimax  Principles  under  certain  con- 
ditions. 
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Assuming  Distributions  with  Known  Bounds  on  Moments. 

It  is  contended  that  in  practice  there  is  usually  some  knowledge  about  the 
states  of  nature  for  a decision  problem,  but  it  cannot  be  expressed  directly 
in  the  form  of  the  constraints  in  (2) . This  knowledge  rarely  extends  beyond 
the  first  two  moments  and  it  is  usually  in  the  form  of  known  bounds  on  par- 
ticular event  probabilities  and  moments.  Let  us  now  restrict  ourselves  to 
an  X that  takes  on  discrete  values,  X2»...  with  probabilities  p^ , p2>... 

respectively.  The  problem  of  assuming  a distribution  over  £2  is  now  stated 
as 


max:  H(P)  = - l p.  log  p 

P i# 

s.t.  I p,  = 1 , p.  > 0 

i 1 


(3) 


jk 


- WPi^jk 


j = 1,  2,. 


0,  1,. . . 


where  1^  is  a set  of  integers  with  and  6^  constants  and  Cq(x^)  = 1 

for  each  i.  The  solution  to  (3)  is  not  generally  obtainable  in  a simple  closed 
form.  It  can,  however,  be  solved  numerically  without  difficulty.  We  observe 
that  (3)  is  a standard  nonlinear  programming  problem  of  the  form 


max  H(P) 
P 


Pig 


where  H(.)  is  concave  and  separable  and  the  constraints  are  all  linear  in 
p.  This  problem  can  be  solved  by  the  conventional  methods  of  Lagrangian  penal- 
ties, reduced  gradient,  local  linearization,  or  separable  programming. 


Once  (3)  is  solved  and  a discrete  probability  function  p is  determined 
then  the  remaining  problem  is  to  select  an  optimum  alternative  from  the  set  A. 

As  mentioned  previously,  the  decision  problem  is  now  treated  as  one  involving 
risk.  Letting  X take  on  values  x ^ , a^  € A,  and  v ■ v(a^»  Xj)» 

Vi  = 1,...,  m;  j = 1,...,  n,  our  discrete  analog  to  (1)  is  to  chose  a^  such 
that 

EV (a* , X)  = max  [ p.v  (4) 

1 i j 2 3 

3.  A NUMERICAL  EXAMPLE. 

We  shall  demonstrate  the  GMEP  by  application  to  an  oil  spill  abatement 
planning  problem  that  is  of  current  concern  to  the  U.S.  Naval  Facilities  En- 
gineering Command.  Although  preventive  policies  and  procedures  are  enforced, 
accidental  and  unavoidable  oil  spills  do  occur  randomly  in  both  at-sea  and  in- 
harbor waterways.  It  is  therefore  necessary  that  clean-up  capabilities  be 
provided  to  remove  these  spills  from  the  water.  A decision  wcs  made  at  the 
beginning  of  this  decade  to  improve  our  capabilities  of  dealing  with  these  spills. 
The  program  development  for  harbor  oil  spills,  which  is  of  interest  here,  is  to 
develop  systems  for  removing  such  spills  efficiently  and  economically.  An  in- 
termediate effort  calls  for  the  utilization  of  the  "best"  commercially  available 
equipment  and  hardware,  which  are  to  be  refined  with  Navy-developed  techniques 
for  longer  range  efforts.  Decision  problems  exist  in  specifying  combinations 
of  equipment,  policies,  and  manpower  to  cope  with  oil  spills  occurring  in  these 
harbor  regions. 

Formulation. 

The  problem  of  concern  is,  for  a fixed  region,  to  make  an  optimal  selection 
from  among  the  alternative  systems  available  for  cleaning  up  oil  spills.  The 
states  of  nature  are  described  in  Table  I according  to  the  environmental  condi- 
■}.  tions,  number  and  volume  of  spills,  and  relative  confinement  of  the  spilt  oil. 
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Although  there  are  many  factors  that  affect  clean-up  performance,  these  are  known 
to  be  the  most  pertinent  [28].  Here  we  have  simplified  our  problem  by  aggregating 
these  conditions  and  defining  only  twelve  states.  For  example;  the  environ- 
mental conditions  include  such  factors  as  wind  and  current  velocity,  sea  state, 
air  and  water  temperature,  wave  height,  and  many  other  factors  that  influence 
clean-up  performance.  Moreover,  the  relative  effects  of  these  influences  vary 

j 

from  time  to  time  and  seasonally.  For  our  purposes  here,  we  are  aggregating 


these  factors  into  "GOOD"  and  "FAIR/POOR"  classes  and  neglecting  any  time  varia- 
tions. Similarly,  we  assume  there  is  a known  threshold  for  the  number  of  spills 
that  separates  the  frequency  classes  "FEW"  and  "MODERATE."  Accepted  ranges 


have  been  established  for  small,  medium,  and  large  volume  spills  as  0-100,  100- 


1000,  and  above  1000  gallons,  respectively  [28].  This  classification  applies 
to  both  pier side  and  open  area  harbor  oil  spills. 


Every  choice  of  clean-up  capability  must  include  the  functions  of  contain- 
ment, removal,  storage/transfer , and  separation.  Although  combinatorially  a 
large  number  of  potential  alternative  systems  result  from  combinations  of  the 
commercially  available  subsystems,  the  set  can  be  reduced  to  a reasonable  num- 


ber. Many  of  these  subsystems  have  already  been  evaluated  in  terms  of  their 
potential  as  alternatives  (Widawsky  [25]).  The  following  four  alternatives 
typify  the  ultimate  decision  problem  for  a particular  region: 

a^  = contract  all  clean-up  activities. 

a 2 = contract  for  pierside  clean-up  and  procure  equipment 
set  A for  open  area  spills. 

a ^ = procure  equipment  set  B for  pierside  clean-up  and 
contract  for  open  area  spills, 
a^  = procure  equipment  set  C for  all  spills. 

The  decision  matrix  for  this  problem  is  given  in  Table  II,  which  contains 
the  corresponding  values  for  combinations  of  these  4 alternatives  and  the  12 
states  of  nature  from  Table  I.  Here  the  matrix  entries,  v„  , represent 
average  equivalent  annual  costs. 

TABLE  II.  Value  Matrix 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

10 

15 

18 

18 

18 

21 

17 

20 

32 

25 

40 

10 

15 

18 

18 

18 

21 

20 

24 

26 

24 

28 

12 

8 

12 

10 

10 

14 

17 

20 

32 

25 

40 

10 

12 

15 

14 

12 

14 

14 

18 

28 

32 

38 

As  previously  mentioned,  one  usually  has  more  information  about  a decision 
problem  than  that  given  in  Table  II.  For  example,  in  a particular  region  it 
may  be  known  that  infrequently  occurring  small  and  medium  volume  pierslde  spills 
during  good  environmental  conditions  are  more  probable  than,  say,  the  remain- 
ing 11  states  described  in  Table  II.  This  translates  to  the  constraint: 
p.^  < p^,  Vi  = 2,...,  12.  It  might  further  be  known  from  experience  that 
p^  £ p2>  V i = 3,...  12  and  that  p^  lies  somewhere  between  Q.25  and  0.60. 

Let  us  further  typify  our  numerical  example  by  including  this  and  additional 
information  in  the  following  set  of  constraints: 


pi  1 pp  1 = 3 12 

Pi  + P2  - p3  + p5  + p6 

p5  i p6 
p10  1 p7 
p9  1 p8 
p12  i pll  + p6 
P7  ± P10+  Pll+  P12 


.25  < p2  £ .50 

•10  1 p3  1 *40 

0 < Pi  < .30,  i - 4,  11,  12 

0 < Pi  < .50,  1=5,  . . .,9 

0 1 Pio  1 • 20 


Solution  by  GMEP . 

A local  stepwise  gradient  descent  algorithm  of  the  feasible  directions 
class  [2]  was  used  to  obtain  the  solution: 


p^*  = .250,  p2*  = .250,  pj*  - .100,  p^*  = .047 
p5*  = .044,  p6*  = .051,  p?*  = .082,  p8*  <•  .048 
p9*  = .047,  p1Q*  = .027,  pu*  = .027,  p12*  = .027 

with  H(p*)  = 2.144.  Thus,  from  (4)  we  have  for  the  data  of  Table  II. 

EV (a* ,X)  = min  [17.7,  17.0,  14.8,  15.3]  * 14.8 
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Since  the  v^  values  are  costs  in  this  case,  we  must  minimize  EV(a^,X).  So 
for  this  region  the  appropriate  choice  of  alternative  is  a^,  contract  all  open 
area  spills  and  procure  equipment  B for  pier side  spills,  based  on  the  ME  Principle. 

Obviously,  there  are  other  relevant  factors  that  have  been  omitted  in  this 
example.  The  actual  selection  of  harbor  oil  spill  clean-up  systems  involves 
many  states  of  nature,  some  of  which  are  peculiar  to  given  regions.  It  is  also 
worth  noting  that  in  practice  the  problem  of  solving  (2)  or  (3)  cannot  be 
separated  from  a statistical  estimation  problem. 

4.  OTHER  AREAS  OF  APPLICATION  AND  FINAL  REMARKS. 

The  GMEP  described  is  a useful  method  for  analyzing  decision  problems 
under  uncertainty  with  bounds  on  event  probabilities  and  moments.  For  this  rea- 
son it  is  particularly  amenable  to  problems  in  systems  design  and  evaluation 
whereby  knowledge  about  the  future  states  of  nature  is  sparse  and  generally  quite 
limited.  Such  is  the  case  for  the  oil  spill  recovery  system  selection  problem 
ip  the  numerical  example.  We  shall  now  summarize  four  conventional  areas  of 
operations  research  within  which  the  GMEP  offers  potential  for  systems  evaluation. 
Location  Analysis. 

A basic  problem  in  location-allocation  theory  is  to  find  optimum  locations 
for  a set  of  new  facilities  with  respect  to  a set  of  existing  ones.  One  selects 
an  arrangement  that  results  in  a mini’-  cost  associated  with  making  transitions 

among  facilities.  Although  deterministic  versions  of  this  problem  have  been 
studied  extensively,  the  more  realistic  stochastic  version  has  received  little 
attention  (Weber  [3],  Cooper  [24],  Seppala  [17]).  The  transitions  among  facili- 
ties result  from  demands  that  are  random  and  are  typically  assumed  to  be  normally 
distributed  for  analytical  convenience.  We  argue  that  there  is  usually  some 
knowledge  about  these  demands  and  that  it  can  be  expressed  in  the  form  of  the 
constraints  in  (3).  Hence,  a reasonable  generalization  of  the  location  prob- 
lem is  to  incorporate  maximum  entropy  demand  distributions. 
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Inventory  Systems. 


Another  area  within  which  system  performance  and  effectiveness  are  largely 
affected  by  randomly  occuring  demands  is  inventory.  Virtually  all  inventory 
decisions  are  affected  by  the  uncertainties  associated  with  demands.  There  are 
numerous  models  and  decision  rules  available  for  prescribing  such  things  as  the 
times,  amounts,  and  types  of  items  for  order  provided  the  probability  structure 
of  the  demands  is  known.  Richards  and  Thomas  [15]  applied  the  maximum  entropy 
concept  to  select  an  appropriate  family  of  demand  distributions  to  be  used  in  . 
policy  level  decisions  for  a large  scale  multi-item  inventory  system.  In  [15] 
only  moment  constraints  in  the  form  of  (2)  were  considered.  Again,  more  realistic 
input  information  can  be  accommodated  for  evaluating  these  and  related  inventory 
decisions . 

Queueing  Systems. 

Akin  to  the  inventory  problem  are  queueing  problems  that  arise  from  arriving 
customers  or  items  having  to  await  service  (see  Rosenshine  [16]  for  a recent  re- 
view) . Although  queueing  theory  provides  numerous  developments  for  evaluating 
systems,  most  of  the  literature  is  devoted  to  ramifications  of  fixed  and  known 
arrival  processes,  service  processes,  and  operating  conditions.  For  the  develop- 
ment and  design  of  new  systems,  however,  one  is  usually  trying  to  select  or  pre- 
scribe the  best  choice  of  service  system  and  related  activities  for  a given  arrival 
process.  There  is  a need  for  exploring  such  procedures  as  the  GMEP  for  this  pur- 
pose. 

Reliability 

A fourth  area  of  potential  application  is  in  evaluating  the  reliability  of 
operating  systems.  While  reliability  theory  for  the  most  part  has  evolved  about 
hardware  components  and  systems,  there  is  current  interest  in  adopting  similar 
quantitative  techniques  for  human  error  evaluation.  It  is  recognized  that  most 
systems  warranting  analysis  are  man-machine  systems  and  the  human  aspects  of  the 
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problem  cannot  be  overlooked.  Unfortunately,  the  human  data  tends  to  be  more 
variable  and  sensitive  to  environmental  conditions.  Regulinski  [14]  has  proposed 
using  maximum  entropy  concepts  to  deal  with  this  problem.  A similar  and  related 
problem  area  is  the  relatively  new  "software  reliability"  area. 

Indeed  these  four  areas  of  application  are  not  exhaustive.  Many  problems 
arise  in  the  analysis  and  evaluation  of  systems  that  require  descriptions  of 
random  phenomena  and  hence  one  or  more  probability  distributions  must  be  assumed. 
The  amount  of  information  available  for  supporting  such  assumptions  is  generally 
quite  limited.  Obviously,  to  some  decision  makers  and  for  certain  applications 
the  rationality  of  the  ME  Principle  might  be  questionable.  For  cases  where  this 
rationality  is  acceptable,  however,  the  generalization  described  here  is  useful 
for  making  the  best  utilization  from  available  known  facts  in  analyzing  decisions 
under  uncertainty. 
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APPENDIX 


ME  DISTRIBUTIONS  WITH  KNOWN  MOMENTS 


The  formulation  of  the  problem  of  determining  the  ME  distribution  with 
known  moment  constraints  is  given  in  (2) . Although  this  optimization  problem 
may  be  solved  for  either  cases  of  continuous  or  discrete  X , the  results 
follow  directly  by  inspection.  It  is  known  (see  Kullback  [11])  that  for  an 
exponential  family  of  probability  functions 


f(x|6)  = c(0)  h(x)  exp[J  7T  (0)  T (x)]  , 

i 1 1 


where  0e0  , the  set  of  parameters,  and 


c(0)  *=  l h(x)  exp [ J ir  (0)  T (x)] 
x i 1 1 


for  discrete  X (and  the  obvious  analog  for  the  absolutely  continuous  case) , 
is  a maximum  entropy  distribution  whenever  the  constraints  in  (2)  are  of  the 


E[T±(x)]  = u . 


Thus  one  can  derive  the  ME  distribution  by  merely  factoring  the  pdf  or  dpf 
as  the  case  may  be.  Table  1 gives  some  classical  ME  pdf's  and  associated 
constraints.  Some  discrete  probability  functions  are  given  in  Table  2. 


• •*,  4 * * . ■ y . 


TABLE  1.  Some  Classical  Maximum  Entropy  Probability  Density  Functions 


Normal  £0(x) 


TABLE  2.  Some  Classical  Maximum  Entropy  Discrete  Probability  Functions. 


ln{x!  ] 


TABLE  2.  (Cont'd.) 


emann  Zeta  Function 
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